Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 21,428 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.3 MiB |
| Average record size in memory | 355.4 B |
Variable types
| Categorical | 8 |
|---|---|
| Numeric | 10 |
market_segment_type is highly overall correlated with repeated_guest | High correlation |
no_of_previous_bookings_not_canceled is highly overall correlated with repeated_guest | High correlation |
repeated_guest is highly overall correlated with market_segment_type and 1 other fields | High correlation |
no_of_adults is highly imbalanced (51.5%) | Imbalance |
type_of_meal_plan is highly imbalanced (54.0%) | Imbalance |
required_car_parking_space is highly imbalanced (74.8%) | Imbalance |
room_type_reserved is highly imbalanced (56.7%) | Imbalance |
market_segment_type is highly imbalanced (54.4%) | Imbalance |
repeated_guest is highly imbalanced (79.7%) | Imbalance |
no_of_previous_cancellations is highly skewed (γ1 = 21.9574157) | Skewed |
no_of_children has 19304 (90.1%) zeros | Zeros |
no_of_weekend_nights has 9171 (42.8%) zeros | Zeros |
no_of_week_nights has 1497 (7.0%) zeros | Zeros |
lead_time has 944 (4.4%) zeros | Zeros |
no_of_previous_cancellations has 21221 (99.0%) zeros | Zeros |
no_of_previous_bookings_not_canceled has 20795 (97.0%) zeros | Zeros |
avg_price_per_room has 368 (1.7%) zeros | Zeros |
no_of_special_requests has 9925 (46.3%) zeros | Zeros |
Reproduction
| Analysis started | 2025-05-11 06:44:01.884740 |
|---|---|
| Analysis finished | 2025-05-11 06:44:15.744256 |
| Duration | 13.86 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
no_of_adults
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | |
| 0 | 117 |
| 4 | 14 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 15596 | |
| 1 | 3950 | 18.4% |
| 3 | 1751 | 8.2% |
| 0 | 117 | 0.5% |
| 4 | 14 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 15596 | |
| 1 | 3950 | 18.4% |
| 3 | 1751 | 8.2% |
| 0 | 117 | 0.5% |
| 4 | 14 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 15596 | |
| 1 | 3950 | 18.4% |
| 3 | 1751 | 8.2% |
| 0 | 117 | 0.5% |
| 4 | 14 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21428 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 15596 | |
| 1 | 3950 | 18.4% |
| 3 | 1751 | 8.2% |
| 0 | 117 | 0.5% |
| 4 | 14 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21428 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 15596 | |
| 1 | 3950 | 18.4% |
| 3 | 1751 | 8.2% |
| 0 | 117 | 0.5% |
| 4 | 14 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21428 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 15596 | |
| 1 | 3950 | 18.4% |
| 3 | 1751 | 8.2% |
| 0 | 117 | 0.5% |
| 4 | 14 | 0.1% |
no_of_children
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1413571 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 19304 |
| Zeros (%) | 90.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.46468757 |
|---|---|
| Coefficient of variation (CV) | 3.2873309 |
| Kurtosis | 30.682261 |
| Mean | 0.1413571 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.1437565 |
| Sum | 3029 |
| Variance | 0.21593454 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19304 | |
| 1 | 1257 | 5.9% |
| 2 | 848 | 4.0% |
| 3 | 16 | 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 19304 | |
| 1 | 1257 | 5.9% |
| 2 | 848 | 4.0% |
| 3 | 16 | 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 3 | 16 | 0.1% |
| 2 | 848 | 4.0% |
| 1 | 1257 | 5.9% |
| 0 | 19304 |
no_of_weekend_nights
Real number (ℝ)
Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.87913011 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 9171 |
| Zeros (%) | 42.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.88370138 |
|---|---|
| Coefficient of variation (CV) | 1.0051998 |
| Kurtosis | 0.1472271 |
| Mean | 0.87913011 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.62570335 |
| Sum | 18838 |
| Variance | 0.78092813 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9171 | |
| 1 | 6090 | |
| 2 | 5925 | |
| 3 | 116 | 0.5% |
| 4 | 95 | 0.4% |
| 5 | 16 | 0.1% |
| 6 | 15 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 9171 | |
| 1 | 6090 | |
| 2 | 5925 | |
| 3 | 116 | 0.5% |
| 4 | 95 | 0.4% |
| 5 | 16 | 0.1% |
| 6 | 15 | 0.1% |
| Value | Count | Frequency (%) |
| 6 | 15 | 0.1% |
| 5 | 16 | 0.1% |
| 4 | 95 | 0.4% |
| 3 | 116 | 0.5% |
| 2 | 5925 | |
| 1 | 6090 | |
| 0 | 9171 |
no_of_week_nights
Real number (ℝ)
Zeros 
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2556468 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 1497 |
| Zeros (%) | 7.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4942457 |
|---|---|
| Coefficient of variation (CV) | 0.66244666 |
| Kurtosis | 6.3394119 |
| Mean | 2.2556468 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.488749 |
| Sum | 48334 |
| Variance | 2.2327702 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 5967 | |
| 1 | 5817 | |
| 3 | 4600 | |
| 4 | 1997 | 9.3% |
| 0 | 1497 | 7.0% |
| 5 | 1165 | 5.4% |
| 6 | 147 | 0.7% |
| 7 | 88 | 0.4% |
| 8 | 53 | 0.2% |
| 10 | 34 | 0.2% |
| Other values (7) | 63 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 1497 | 7.0% |
| 1 | 5817 | |
| 2 | 5967 | |
| 3 | 4600 | |
| 4 | 1997 | 9.3% |
| 5 | 1165 | 5.4% |
| 6 | 147 | 0.7% |
| 7 | 88 | 0.4% |
| 8 | 53 | 0.2% |
| 9 | 27 | 0.1% |
| Value | Count | Frequency (%) |
| 17 | 2 | < 0.1% |
| 15 | 6 | < 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 5 | < 0.1% |
| 12 | 5 | < 0.1% |
| 11 | 12 | 0.1% |
| 10 | 34 | 0.2% |
| 9 | 27 | 0.1% |
| 8 | 53 | |
| 7 | 88 |
type_of_meal_plan
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| Meal Plan 1 | |
|---|---|
| Not Selected | |
| Meal Plan 2 | 978 |
| Meal Plan 3 | 4 |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.172298 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not Selected |
|---|---|
| 2nd row | Meal Plan 1 |
| 3rd row | Meal Plan 1 |
| 4th row | Not Selected |
| 5th row | Meal Plan 1 |
Common Values
| Value | Count | Frequency (%) |
| Meal Plan 1 | 16754 | |
| Not Selected | 3692 | 17.2% |
| Meal Plan 2 | 978 | 4.6% |
| Meal Plan 3 | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| meal | 17736 | |
| plan | 17736 | |
| 1 | 16754 | |
| not | 3692 | 6.1% |
| selected | 3692 | 6.1% |
| 2 | 978 | 1.6% |
| 3 | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 39164 | |
| 39164 | ||
| a | 35472 | |
| e | 28812 | |
| M | 17736 | |
| P | 17736 | |
| n | 17736 | |
| 1 | 16754 | |
| t | 7384 | 3.1% |
| N | 3692 | 1.5% |
| Other values (6) | 15750 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 139644 | |
| Uppercase Letter | 42856 | 17.9% |
| Space Separator | 39164 | 16.4% |
| Decimal Number | 17736 | 7.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 39164 | |
| a | 35472 | |
| e | 28812 | |
| n | 17736 | |
| t | 7384 | 5.3% |
| o | 3692 | 2.6% |
| c | 3692 | 2.6% |
| d | 3692 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 17736 | |
| P | 17736 | |
| N | 3692 | 8.6% |
| S | 3692 | 8.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 16754 | |
| 2 | 978 | 5.5% |
| 3 | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 39164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 182500 | |
| Common | 56900 | 23.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 39164 | |
| a | 35472 | |
| e | 28812 | |
| M | 17736 | |
| P | 17736 | |
| n | 17736 | |
| t | 7384 | 4.0% |
| N | 3692 | 2.0% |
| o | 3692 | 2.0% |
| S | 3692 | 2.0% |
| Other values (2) | 7384 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 39164 | ||
| 1 | 16754 | |
| 2 | 978 | 1.7% |
| 3 | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 239400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 39164 | |
| 39164 | ||
| a | 35472 | |
| e | 28812 | |
| M | 17736 | |
| P | 17736 | |
| n | 17736 | |
| 1 | 16754 | |
| t | 7384 | 3.1% |
| N | 3692 | 1.5% |
| Other values (6) | 15750 |
required_car_parking_space
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| 0 | |
|---|---|
| 1 | 901 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 20527 | |
| 1 | 901 | 4.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 20527 | |
| 1 | 901 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 20527 | |
| 1 | 901 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21428 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20527 | |
| 1 | 901 | 4.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21428 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 20527 | |
| 1 | 901 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21428 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 20527 | |
| 1 | 901 | 4.2% |
room_type_reserved
Categorical
Imbalance 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| Room_Type 1 | |
|---|---|
| Room_Type 4 | |
| Room_Type 6 | 777 |
| Room_Type 2 | 506 |
| Room_Type 5 | 189 |
| Other values (2) | 127 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Room_Type 1 |
|---|---|
| 2nd row | Room_Type 1 |
| 3rd row | Room_Type 1 |
| 4th row | Room_Type 1 |
| 5th row | Room_Type 1 |
Common Values
| Value | Count | Frequency (%) |
| Room_Type 1 | 15451 | |
| Room_Type 4 | 4378 | 20.4% |
| Room_Type 6 | 777 | 3.6% |
| Room_Type 2 | 506 | 2.4% |
| Room_Type 5 | 189 | 0.9% |
| Room_Type 7 | 122 | 0.6% |
| Room_Type 3 | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| room_type | 21428 | |
| 1 | 15451 | |
| 4 | 4378 | 10.2% |
| 6 | 777 | 1.8% |
| 2 | 506 | 1.2% |
| 5 | 189 | 0.4% |
| 7 | 122 | 0.3% |
| 3 | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 42856 | |
| R | 21428 | |
| m | 21428 | |
| _ | 21428 | |
| T | 21428 | |
| y | 21428 | |
| p | 21428 | |
| e | 21428 | |
| 21428 | ||
| 1 | 15451 | 6.6% |
| Other values (6) | 5977 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 128568 | |
| Uppercase Letter | 42856 | 18.2% |
| Connector Punctuation | 21428 | 9.1% |
| Space Separator | 21428 | 9.1% |
| Decimal Number | 21428 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 15451 | |
| 4 | 4378 | 20.4% |
| 6 | 777 | 3.6% |
| 2 | 506 | 2.4% |
| 5 | 189 | 0.9% |
| 7 | 122 | 0.6% |
| 3 | 5 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 42856 | |
| m | 21428 | |
| y | 21428 | |
| p | 21428 | |
| e | 21428 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 21428 | |
| T | 21428 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 21428 |
Space Separator
| Value | Count | Frequency (%) |
| 21428 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 171424 | |
| Common | 64284 | 27.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 21428 | |
| 21428 | ||
| 1 | 15451 | |
| 4 | 4378 | 6.8% |
| 6 | 777 | 1.2% |
| 2 | 506 | 0.8% |
| 5 | 189 | 0.3% |
| 7 | 122 | 0.2% |
| 3 | 5 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| o | 42856 | |
| R | 21428 | |
| m | 21428 | |
| T | 21428 | |
| y | 21428 | |
| p | 21428 | |
| e | 21428 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 235708 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 42856 | |
| R | 21428 | |
| m | 21428 | |
| _ | 21428 | |
| T | 21428 | |
| y | 21428 | |
| p | 21428 | |
| e | 21428 | |
| 21428 | ||
| 1 | 15451 | 6.6% |
| Other values (6) | 5977 | 2.5% |
lead_time
Real number (ℝ)
Zeros 
| Distinct | 351 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.349776 |
| Minimum | 0 |
|---|---|
| Maximum | 443 |
| Zeros | 944 |
| Zeros (%) | 4.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 12 |
| median | 45 |
| Q3 | 101.25 |
| 95-th percentile | 210 |
| Maximum | 443 |
| Range | 443 |
| Interquartile range (IQR) | 89.25 |
Descriptive statistics
| Standard deviation | 69.447828 |
|---|---|
| Coefficient of variation (CV) | 1.0311516 |
| Kurtosis | 1.8205715 |
| Mean | 67.349776 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 1.39884 |
| Sum | 1443171 |
| Variance | 4823.0009 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 944 | 4.4% |
| 1 | 733 | 3.4% |
| 2 | 484 | 2.3% |
| 4 | 453 | 2.1% |
| 3 | 451 | 2.1% |
| 5 | 404 | 1.9% |
| 6 | 360 | 1.7% |
| 7 | 314 | 1.5% |
| 8 | 297 | 1.4% |
| 12 | 259 | 1.2% |
| Other values (341) | 16729 |
| Value | Count | Frequency (%) |
| 0 | 944 | |
| 1 | 733 | |
| 2 | 484 | |
| 3 | 451 | |
| 4 | 453 | |
| 5 | 404 | |
| 6 | 360 | 1.7% |
| 7 | 314 | 1.5% |
| 8 | 297 | 1.4% |
| 9 | 248 | 1.2% |
| Value | Count | Frequency (%) |
| 443 | 2 | < 0.1% |
| 433 | 2 | < 0.1% |
| 418 | 5 | |
| 386 | 8 | |
| 381 | 2 | < 0.1% |
| 377 | 6 | |
| 372 | 1 | < 0.1% |
| 361 | 1 | < 0.1% |
| 359 | 6 | |
| 355 | 1 | < 0.1% |
arrival_year
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| 2018 | |
|---|---|
| 2017 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2018 |
|---|---|
| 2nd row | 2018 |
| 3rd row | 2018 |
| 4th row | 2018 |
| 5th row | 2018 |
Common Values
| Value | Count | Frequency (%) |
| 2018 | 18230 | |
| 2017 | 3198 | 14.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2018 | 18230 | |
| 2017 | 3198 | 14.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 21428 | |
| 0 | 21428 | |
| 1 | 21428 | |
| 8 | 18230 | |
| 7 | 3198 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 85712 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 21428 | |
| 0 | 21428 | |
| 1 | 21428 | |
| 8 | 18230 | |
| 7 | 3198 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 85712 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 21428 | |
| 0 | 21428 | |
| 1 | 21428 | |
| 8 | 18230 | |
| 7 | 3198 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85712 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 21428 | |
| 0 | 21428 | |
| 1 | 21428 | |
| 8 | 18230 | |
| 7 | 3198 | 3.7% |
arrival_month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.337269 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.1441285 |
|---|---|
| Coefficient of variation (CV) | 0.42851482 |
| Kurtosis | -1.0026178 |
| Mean | 7.337269 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.29729955 |
| Sum | 157223 |
| Variance | 9.8855443 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 2646 | |
| 8 | 2542 | |
| 9 | 2518 | |
| 12 | 1971 | |
| 7 | 1850 | |
| 11 | 1823 | |
| 4 | 1668 | |
| 3 | 1646 | |
| 6 | 1529 | |
| 5 | 1504 | |
| Other values (2) | 1731 |
| Value | Count | Frequency (%) |
| 1 | 656 | 3.1% |
| 2 | 1075 | |
| 3 | 1646 | |
| 4 | 1668 | |
| 5 | 1504 | |
| 6 | 1529 | |
| 7 | 1850 | |
| 8 | 2542 | |
| 9 | 2518 | |
| 10 | 2646 |
| Value | Count | Frequency (%) |
| 12 | 1971 | |
| 11 | 1823 | |
| 10 | 2646 | |
| 9 | 2518 | |
| 8 | 2542 | |
| 7 | 1850 | |
| 6 | 1529 | |
| 5 | 1504 | |
| 4 | 1668 | |
| 3 | 1646 |
arrival_date
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.74202 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.8433257 |
|---|---|
| Coefficient of variation (CV) | 0.56176563 |
| Kurtosis | -1.2045586 |
| Mean | 15.74202 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.015919099 |
| Sum | 337320 |
| Variance | 78.20441 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26 | 784 | 3.7% |
| 2 | 774 | 3.6% |
| 7 | 769 | 3.6% |
| 4 | 756 | 3.5% |
| 19 | 756 | 3.5% |
| 20 | 746 | 3.5% |
| 8 | 743 | 3.5% |
| 29 | 743 | 3.5% |
| 17 | 730 | 3.4% |
| 11 | 728 | 3.4% |
| Other values (21) | 13899 |
| Value | Count | Frequency (%) |
| 1 | 647 | |
| 2 | 774 | |
| 3 | 669 | |
| 4 | 756 | |
| 5 | 699 | |
| 6 | 668 | |
| 7 | 769 | |
| 8 | 743 | |
| 9 | 688 | |
| 10 | 645 |
| Value | Count | Frequency (%) |
| 31 | 400 | |
| 30 | 676 | |
| 29 | 743 | |
| 28 | 718 | |
| 27 | 693 | |
| 26 | 784 | |
| 25 | 679 | |
| 24 | 590 | |
| 23 | 615 | |
| 22 | 662 |
market_segment_type
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.3 MiB |
| Online | |
|---|---|
| Offline | |
| Corporate | 1169 |
| Complementary | 274 |
| Aviation | 82 |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.4228113 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Online |
|---|---|
| 2nd row | Online |
| 3rd row | Offline |
| 4th row | Online |
| 5th row | Online |
Common Values
| Value | Count | Frequency (%) |
| Online | 16432 | |
| Offline | 3471 | 16.2% |
| Corporate | 1169 | 5.5% |
| Complementary | 274 | 1.3% |
| Aviation | 82 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 16432 | |
| offline | 3471 | 16.2% |
| corporate | 1169 | 5.5% |
| complementary | 274 | 1.3% |
| aviation | 82 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 36691 | |
| e | 21620 | |
| l | 20177 | |
| i | 20067 | |
| O | 19903 | |
| f | 6942 | 5.0% |
| o | 2694 | 2.0% |
| r | 2612 | 1.9% |
| a | 1525 | 1.1% |
| t | 1525 | 1.1% |
| Other values (6) | 3872 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 116200 | |
| Uppercase Letter | 21428 | 15.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 36691 | |
| e | 21620 | |
| l | 20177 | |
| i | 20067 | |
| f | 6942 | 6.0% |
| o | 2694 | 2.3% |
| r | 2612 | 2.2% |
| a | 1525 | 1.3% |
| t | 1525 | 1.3% |
| p | 1443 | 1.2% |
| Other values (3) | 904 | 0.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 19903 | |
| C | 1443 | 6.7% |
| A | 82 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 137628 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 36691 | |
| e | 21620 | |
| l | 20177 | |
| i | 20067 | |
| O | 19903 | |
| f | 6942 | 5.0% |
| o | 2694 | 2.0% |
| r | 2612 | 1.9% |
| a | 1525 | 1.1% |
| t | 1525 | 1.1% |
| Other values (6) | 3872 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 137628 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 36691 | |
| e | 21620 | |
| l | 20177 | |
| i | 20067 | |
| O | 19903 | |
| f | 6942 | 5.0% |
| o | 2694 | 2.0% |
| r | 2612 | 1.9% |
| a | 1525 | 1.1% |
| t | 1525 | 1.1% |
| Other values (6) | 3872 | 2.8% |
repeated_guest
Categorical
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.2 MiB |
| 0 | |
|---|---|
| 1 | 678 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 20750 | |
| 1 | 678 | 3.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 20750 | |
| 1 | 678 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 20750 | |
| 1 | 678 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21428 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20750 | |
| 1 | 678 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21428 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 20750 | |
| 1 | 678 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21428 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 20750 | |
| 1 | 678 | 3.2% |
no_of_previous_cancellations
Real number (ℝ)
Skewed  Zeros 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.028467426 |
| Minimum | 0 |
|---|---|
| Maximum | 13 |
| Zeros | 21221 |
| Zeros (%) | 99.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.42699297 |
|---|---|
| Coefficient of variation (CV) | 14.999353 |
| Kurtosis | 540.07914 |
| Mean | 0.028467426 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.957416 |
| Sum | 610 |
| Variance | 0.182323 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 21221 | |
| 1 | 103 | 0.5% |
| 2 | 33 | 0.2% |
| 3 | 29 | 0.1% |
| 11 | 24 | 0.1% |
| 4 | 9 | < 0.1% |
| 5 | 7 | < 0.1% |
| 13 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 21221 | |
| 1 | 103 | 0.5% |
| 2 | 33 | 0.2% |
| 3 | 29 | 0.1% |
| 4 | 9 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 1 | < 0.1% |
| 11 | 24 | 0.1% |
| 13 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13 | 1 | < 0.1% |
| 11 | 24 | 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 7 | < 0.1% |
| 4 | 9 | < 0.1% |
| 3 | 29 | 0.1% |
| 2 | 33 | 0.2% |
| 1 | 103 | 0.5% |
| 0 | 21221 |
no_of_previous_bookings_not_canceled
Real number (ℝ)
High correlation  Zeros 
| Distinct | 51 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.19801195 |
| Minimum | 0 |
|---|---|
| Maximum | 58 |
| Zeros | 20795 |
| Zeros (%) | 97.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 58 |
| Range | 58 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.9586765 |
|---|---|
| Coefficient of variation (CV) | 9.8917087 |
| Kurtosis | 345.7301 |
| Mean | 0.19801195 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 16.704832 |
| Sum | 4243 |
| Variance | 3.8364137 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20795 | |
| 1 | 181 | 0.8% |
| 2 | 81 | 0.4% |
| 3 | 61 | 0.3% |
| 4 | 55 | 0.3% |
| 5 | 50 | 0.2% |
| 6 | 31 | 0.1% |
| 8 | 19 | 0.1% |
| 7 | 19 | 0.1% |
| 11 | 14 | 0.1% |
| Other values (41) | 122 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 20795 | |
| 1 | 181 | 0.8% |
| 2 | 81 | 0.4% |
| 3 | 61 | 0.3% |
| 4 | 55 | 0.3% |
| 5 | 50 | 0.2% |
| 6 | 31 | 0.1% |
| 7 | 19 | 0.1% |
| 8 | 19 | 0.1% |
| 9 | 14 | 0.1% |
| Value | Count | Frequency (%) |
| 58 | 1 | |
| 56 | 1 | |
| 54 | 1 | |
| 53 | 1 | |
| 51 | 1 | |
| 50 | 1 | |
| 48 | 2 | |
| 47 | 1 | |
| 46 | 1 | |
| 45 | 1 |
avg_price_per_room
Real number (ℝ)
Zeros 
| Distinct | 3519 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 105.52101 |
| Minimum | 0 |
|---|---|
| Maximum | 365 |
| Zeros | 368 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 80.75 |
| median | 99.9 |
| Q3 | 126.9 |
| 95-th percentile | 170.33 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 46.15 |
Descriptive statistics
| Standard deviation | 37.593969 |
|---|---|
| Coefficient of variation (CV) | 0.35626997 |
| Kurtosis | 2.0145411 |
| Mean | 105.52101 |
| Median Absolute Deviation (MAD) | 21.6 |
| Skewness | 0.56285404 |
| Sum | 2261104.3 |
| Variance | 1413.3065 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65 | 434 | 2.0% |
| 0 | 368 | 1.7% |
| 75 | 343 | 1.6% |
| 95 | 276 | 1.3% |
| 85 | 271 | 1.3% |
| 90 | 253 | 1.2% |
| 80.75 | 239 | 1.1% |
| 94.5 | 221 | 1.0% |
| 96.3 | 211 | 1.0% |
| 76.5 | 195 | 0.9% |
| Other values (3509) | 18617 |
| Value | Count | Frequency (%) |
| 0 | 368 | |
| 0.5 | 1 | < 0.1% |
| 1 | 6 | < 0.1% |
| 1.48 | 1 | < 0.1% |
| 1.6 | 1 | < 0.1% |
| 2 | 5 | < 0.1% |
| 3 | 2 | < 0.1% |
| 4.5 | 1 | < 0.1% |
| 6 | 17 | 0.1% |
| 6.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 365 | 1 | < 0.1% |
| 349.63 | 1 | < 0.1% |
| 332.57 | 1 | < 0.1% |
| 316 | 1 | < 0.1% |
| 314.1 | 1 | < 0.1% |
| 306 | 2 | |
| 300 | 4 | |
| 299.33 | 1 | < 0.1% |
| 297 | 1 | < 0.1% |
| 296 | 1 | < 0.1% |
no_of_special_requests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.73819302 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 9925 |
| Zeros (%) | 46.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 334.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.81238784 |
|---|---|
| Coefficient of variation (CV) | 1.1005087 |
| Kurtosis | 0.39371347 |
| Mean | 0.73819302 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.90896341 |
| Sum | 15818 |
| Variance | 0.659974 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 9925 | |
| 1 | 7803 | |
| 2 | 3150 | 14.7% |
| 3 | 490 | 2.3% |
| 4 | 55 | 0.3% |
| 5 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 9925 | |
| 1 | 7803 | |
| 2 | 3150 | 14.7% |
| 3 | 490 | 2.3% |
| 4 | 55 | 0.3% |
| 5 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 5 | < 0.1% |
| 4 | 55 | 0.3% |
| 3 | 490 | 2.3% |
| 2 | 3150 | 14.7% |
| 1 | 7803 | |
| 0 | 9925 |
booking_status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| Not_Canceled | |
|---|---|
| Canceled |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 10.852716 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not_Canceled |
|---|---|
| 2nd row | Canceled |
| 3rd row | Not_Canceled |
| 4th row | Not_Canceled |
| 5th row | Canceled |
Common Values
| Value | Count | Frequency (%) |
| Not_Canceled | 15282 | |
| Canceled | 6146 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not_canceled | 15282 | |
| canceled | 6146 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 42856 | |
| C | 21428 | |
| a | 21428 | |
| n | 21428 | |
| c | 21428 | |
| l | 21428 | |
| d | 21428 | |
| N | 15282 | 6.6% |
| o | 15282 | 6.6% |
| t | 15282 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 180560 | |
| Uppercase Letter | 36710 | 15.8% |
| Connector Punctuation | 15282 | 6.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 42856 | |
| a | 21428 | |
| n | 21428 | |
| c | 21428 | |
| l | 21428 | |
| d | 21428 | |
| o | 15282 | 8.5% |
| t | 15282 | 8.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 21428 | |
| N | 15282 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15282 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 217270 | |
| Common | 15282 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 42856 | |
| C | 21428 | |
| a | 21428 | |
| n | 21428 | |
| c | 21428 | |
| l | 21428 | |
| d | 21428 | |
| N | 15282 | 7.0% |
| o | 15282 | 7.0% |
| t | 15282 | 7.0% |
Common
| Value | Count | Frequency (%) |
| _ | 15282 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 232552 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 42856 | |
| C | 21428 | |
| a | 21428 | |
| n | 21428 | |
| c | 21428 | |
| l | 21428 | |
| d | 21428 | |
| N | 15282 | 6.6% |
| o | 15282 | 6.6% |
| t | 15282 | 6.6% |
Interactions
Correlations
| arrival_date | arrival_month | arrival_year | avg_price_per_room | booking_status | lead_time | market_segment_type | no_of_adults | no_of_children | no_of_previous_bookings_not_canceled | no_of_previous_cancellations | no_of_special_requests | no_of_week_nights | no_of_weekend_nights | repeated_guest | required_car_parking_space | room_type_reserved | type_of_meal_plan | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| arrival_date | 1.000 | -0.023 | 0.050 | 0.014 | 0.026 | 0.024 | 0.019 | 0.029 | 0.025 | -0.006 | -0.012 | -0.001 | -0.001 | 0.006 | 0.026 | 0.000 | 0.018 | 0.020 |
| arrival_month | -0.023 | 1.000 | 0.365 | 0.040 | 0.178 | 0.075 | 0.071 | 0.085 | -0.002 | -0.003 | -0.003 | 0.119 | 0.035 | 0.014 | 0.086 | 0.062 | 0.050 | 0.040 |
| arrival_year | 0.050 | 0.365 | 1.000 | 0.192 | 0.153 | 0.171 | 0.135 | 0.113 | 0.028 | 0.014 | 0.027 | 0.040 | 0.015 | 0.037 | 0.010 | 0.000 | 0.098 | 0.105 |
| avg_price_per_room | 0.014 | 0.040 | 0.192 | 1.000 | 0.173 | -0.001 | 0.431 | 0.208 | 0.268 | -0.203 | -0.114 | 0.198 | 0.015 | -0.013 | 0.279 | 0.068 | 0.305 | 0.146 |
| booking_status | 0.026 | 0.178 | 0.153 | 0.173 | 1.000 | 0.371 | 0.215 | 0.097 | 0.059 | 0.058 | 0.042 | 0.267 | 0.125 | 0.087 | 0.111 | 0.087 | 0.079 | 0.052 |
| lead_time | 0.024 | 0.075 | 0.171 | -0.001 | 0.371 | 1.000 | 0.124 | 0.119 | 0.028 | -0.211 | -0.102 | 0.016 | 0.298 | 0.177 | 0.162 | 0.044 | 0.062 | 0.088 |
| market_segment_type | 0.019 | 0.071 | 0.135 | 0.431 | 0.215 | 0.124 | 1.000 | 0.219 | 0.053 | 0.173 | 0.112 | 0.169 | 0.097 | 0.083 | 0.548 | 0.111 | 0.141 | 0.158 |
| no_of_adults | 0.029 | 0.085 | 0.113 | 0.208 | 0.097 | 0.119 | 0.219 | 1.000 | 0.187 | 0.085 | 0.050 | 0.097 | 0.094 | 0.068 | 0.289 | 0.012 | 0.338 | 0.089 |
| no_of_children | 0.025 | -0.002 | 0.028 | 0.268 | 0.059 | 0.028 | 0.053 | 0.187 | 1.000 | -0.049 | -0.033 | 0.108 | 0.011 | 0.010 | 0.033 | 0.022 | 0.403 | 0.046 |
| no_of_previous_bookings_not_canceled | -0.006 | -0.003 | 0.014 | -0.203 | 0.058 | -0.211 | 0.173 | 0.085 | -0.049 | 1.000 | 0.445 | -0.026 | -0.140 | -0.090 | 0.543 | 0.050 | 0.036 | 0.018 |
| no_of_previous_cancellations | -0.012 | -0.003 | 0.027 | -0.114 | 0.042 | -0.102 | 0.112 | 0.050 | -0.033 | 0.445 | 1.000 | -0.025 | -0.059 | -0.037 | 0.386 | 0.016 | 0.046 | 0.012 |
| no_of_special_requests | -0.001 | 0.119 | 0.040 | 0.198 | 0.267 | 0.016 | 0.169 | 0.097 | 0.108 | -0.026 | -0.025 | 1.000 | 0.043 | 0.015 | 0.055 | 0.078 | 0.059 | 0.036 |
| no_of_week_nights | -0.001 | 0.035 | 0.015 | 0.015 | 0.125 | 0.298 | 0.097 | 0.094 | 0.011 | -0.140 | -0.059 | 0.043 | 1.000 | 0.076 | 0.140 | 0.059 | 0.048 | 0.053 |
| no_of_weekend_nights | 0.006 | 0.014 | 0.037 | -0.013 | 0.087 | 0.177 | 0.083 | 0.068 | 0.010 | -0.090 | -0.037 | 0.015 | 0.076 | 1.000 | 0.088 | 0.054 | 0.023 | 0.034 |
| repeated_guest | 0.026 | 0.086 | 0.010 | 0.279 | 0.111 | 0.162 | 0.548 | 0.289 | 0.033 | 0.543 | 0.386 | 0.055 | 0.140 | 0.088 | 1.000 | 0.103 | 0.084 | 0.074 |
| required_car_parking_space | 0.000 | 0.062 | 0.000 | 0.068 | 0.087 | 0.044 | 0.111 | 0.012 | 0.022 | 0.050 | 0.016 | 0.078 | 0.059 | 0.054 | 0.103 | 1.000 | 0.030 | 0.029 |
| room_type_reserved | 0.018 | 0.050 | 0.098 | 0.305 | 0.079 | 0.062 | 0.141 | 0.338 | 0.403 | 0.036 | 0.046 | 0.059 | 0.048 | 0.023 | 0.084 | 0.030 | 1.000 | 0.163 |
| type_of_meal_plan | 0.020 | 0.040 | 0.105 | 0.146 | 0.052 | 0.088 | 0.158 | 0.089 | 0.046 | 0.018 | 0.012 | 0.036 | 0.053 | 0.034 | 0.074 | 0.029 | 0.163 | 1.000 |
Missing values
Sample
| no_of_adults | no_of_children | no_of_weekend_nights | no_of_week_nights | type_of_meal_plan | required_car_parking_space | room_type_reserved | lead_time | arrival_year | arrival_month | arrival_date | market_segment_type | repeated_guest | no_of_previous_cancellations | no_of_previous_bookings_not_canceled | avg_price_per_room | no_of_special_requests | booking_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 0 | 0 | 2 | Not Selected | 0 | Room_Type 1 | 12 | 2018 | 1 | 22 | Online | 0 | 0 | 0 | 75.00 | 0 | Not_Canceled |
| 1 | 1 | 0 | 2 | 1 | Meal Plan 1 | 0 | Room_Type 1 | 1 | 2018 | 2 | 28 | Online | 0 | 0 | 0 | 60.00 | 0 | Canceled |
| 2 | 2 | 0 | 1 | 4 | Meal Plan 1 | 0 | Room_Type 1 | 141 | 2018 | 7 | 13 | Offline | 0 | 0 | 0 | 72.25 | 2 | Not_Canceled |
| 3 | 3 | 0 | 2 | 2 | Not Selected | 0 | Room_Type 1 | 135 | 2018 | 7 | 15 | Online | 0 | 0 | 0 | 92.28 | 2 | Not_Canceled |
| 4 | 2 | 0 | 0 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 245 | 2018 | 6 | 17 | Online | 0 | 0 | 0 | 75.00 | 0 | Canceled |
| 5 | 2 | 1 | 2 | 6 | Meal Plan 1 | 0 | Room_Type 1 | 71 | 2018 | 9 | 1 | Online | 0 | 0 | 0 | 150.98 | 3 | Not_Canceled |
| 6 | 3 | 0 | 0 | 2 | Meal Plan 1 | 0 | Room_Type 4 | 93 | 2018 | 7 | 12 | Online | 0 | 0 | 0 | 137.70 | 2 | Not_Canceled |
| 7 | 2 | 0 | 1 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 17 | 2018 | 12 | 8 | Online | 0 | 0 | 0 | 100.38 | 0 | Not_Canceled |
| 8 | 2 | 0 | 0 | 3 | Meal Plan 1 | 0 | Room_Type 4 | 50 | 2018 | 9 | 1 | Online | 0 | 0 | 0 | 98.64 | 0 | Not_Canceled |
| 9 | 1 | 0 | 1 | 1 | Meal Plan 2 | 0 | Room_Type 1 | 301 | 2018 | 7 | 30 | Offline | 0 | 0 | 0 | 90.00 | 0 | Not_Canceled |
| no_of_adults | no_of_children | no_of_weekend_nights | no_of_week_nights | type_of_meal_plan | required_car_parking_space | room_type_reserved | lead_time | arrival_year | arrival_month | arrival_date | market_segment_type | repeated_guest | no_of_previous_cancellations | no_of_previous_bookings_not_canceled | avg_price_per_room | no_of_special_requests | booking_status | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 29008 | 1 | 0 | 1 | 1 | Meal Plan 1 | 0 | Room_Type 4 | 11 | 2018 | 10 | 3 | Online | 0 | 0 | 0 | 130.90 | 1 | Not_Canceled |
| 29009 | 1 | 0 | 0 | 1 | Meal Plan 1 | 0 | Room_Type 1 | 0 | 2018 | 2 | 3 | Online | 0 | 0 | 0 | 79.00 | 0 | Not_Canceled |
| 29012 | 2 | 0 | 1 | 3 | Not Selected | 0 | Room_Type 1 | 39 | 2018 | 11 | 7 | Online | 0 | 0 | 0 | 148.00 | 0 | Canceled |
| 29013 | 2 | 0 | 2 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 148 | 2018 | 5 | 6 | Online | 0 | 0 | 0 | 99.45 | 0 | Not_Canceled |
| 29014 | 1 | 1 | 0 | 1 | Meal Plan 1 | 0 | Room_Type 1 | 5 | 2017 | 9 | 2 | Offline | 0 | 0 | 0 | 60.00 | 1 | Not_Canceled |
| 29015 | 2 | 0 | 0 | 4 | Meal Plan 1 | 0 | Room_Type 5 | 19 | 2017 | 9 | 2 | Offline | 0 | 0 | 0 | 83.55 | 1 | Not_Canceled |
| 29016 | 2 | 0 | 2 | 3 | Meal Plan 1 | 0 | Room_Type 1 | 26 | 2018 | 7 | 3 | Offline | 0 | 0 | 0 | 85.00 | 0 | Not_Canceled |
| 29017 | 2 | 1 | 1 | 3 | Meal Plan 2 | 0 | Room_Type 1 | 150 | 2018 | 7 | 7 | Online | 0 | 0 | 0 | 173.25 | 0 | Canceled |
| 29018 | 2 | 1 | 0 | 2 | Meal Plan 1 | 0 | Room_Type 1 | 127 | 2018 | 12 | 22 | Online | 0 | 0 | 0 | 106.20 | 3 | Not_Canceled |
| 29019 | 2 | 0 | 0 | 1 | Not Selected | 0 | Room_Type 1 | 3 | 2018 | 4 | 19 | Online | 0 | 0 | 0 | 89.00 | 2 | Not_Canceled |